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LEADING DIGIT LAWS ON LINEAR LIE GROUPS 


COREY MANACK AND STEVEN J. MILLER 


Abstract. We determine the leading digit laws for the matrix components of a linear Lie group 
G. These laws generalize the observations that the normalized Haar measure of the Lie group K+ 
is dx/x and that the scale invariance of dx/x implies the distribution of the digits follow Benford’s 
law, which is the probability of observing a signihcand base B of at most s is logg(s); thus the 
hrst digit is d with probability log 5 (l + 1/d)). Viewing this scale invariance as left invariance of 
Haar measure, we determine the power laws in signihcands from one matrix component of various 
such G. We also determine the leading digit distribution of a hxed number of components of a unit 
sphere, and hnd periodic behavior when the dimension of the sphere tends to inhnity in a certain 
progression. 


1. Introduction 


1.1. Background. Given a positive number x and a base B > 1, we may write x = Sb{x)B^^^\ 
where Sb{x) e [1, B) is the signihcand and k{x) E Z. The distribution of SbIx) has interested 
researchers in a variety of helds for over a hundred years, as frequently it is not uniformly dis¬ 
tributed over [l,i?) but exhibits a profound bias. If Prob(S' 5 (a;) < s) = log^(s) we say the 
system follows Benford’s law, which immediately implies the probability of a hrst digit of d is 
\ogQ{d + 1) — log 5 ((i) = log^(l + l/d) (at least if d + 1 < B); in particular, base 10 has a 
hrst digit of 1 about 30% of the time, and 9 for only around 4.5% of the values. This bias was 
hrst observed by Newcomb [INewll in the 1880s, and then rediscovered by Benford [IBenll nearly 50 
years later. 

Many systems follow Benford’s law; on the pure math side these include the Fibonacci num¬ 
bers (and most solutions to linear recurrence relations) HBrDuH . iterates of the 3x-i-l map UKonMii 


LagSol, and values of L-functions on the critical strip among many others; on the applied side 


examples range from voter and hnancial data HMebl |Nig[ | to the average error in boating point 
calculations HKnul . See IIBH3[ IMilll for two recent books on the subject, the latter describing 
many of the applications from detecting fraud in taxes, images, voting and scientihc research, 
IIBH2[ IDia[ IHili IHi2l IPini IRaiH for some classic papers espousing the theory, and HBHll IHull for 
online collections of articles on the subject. 

Our purpose is to explore the distribution of leading digits of components chosen from some 

random process. We concentrate on two related systems. The hrst are various n x n matrix 

2 

ensembles, which of course can be viewed as vectors living in . The second are components of 
a point uniformly chosen on a unit sphere, which turn out to imply results for some of our matrix 
ensembles. 
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Following the work of Montgomery UMonH . Odlyzko HOdlilOdfll . Katz-Sarnak [|KaSalllKaSa2ll . 
Keating-Snaith IIKeSnll[KeSn2[lKeSn3H . Conrey-Farmer-Keating-Rubinstein-Snaith HCFKRSB and 
many others, random matrix ensembles in general, and the elassieal eompaet groups in partieular, 
have been shown to suoeessfully model a variety of number theory objeets, from speeial values to 
distribution of zeros to moments. In some number theory systems Benford’s law has already been 
observed (sueh as values of L-funetions in UKonMiL or values of Fourier eoeffieients in HAnRoStU k 
thus our work ean be interpreted as providing another explanation for the prevalenee of Benford’s 
law. 

We first quiekly review some needed baekground material and then state our results. 


1.2. Haar Measure Review. Random matrix theory has enjoyed numerous sueeesses over the 
past few deeades, sueeessfully modeling a variety of systems from energy levels of heavy nuelei to 
zeros of L-funetions HBFMT-B I IFiMill IHaH . Early work in the subjeet eonsidered ensembles where 
the matrix element were drawn independently from a fixed probability distribution p; this of eourse 
led to questions and eonjeetures on how various statisties (sueh as spaeings between normalized 
eigenvalues) depended on p. For example, while the density of normalized eigenvalues in matrix 
ensembles (Wigner’s semi-eirele law) was known for all ensembles where the entries were ehosen 
independently from niee distributions, the universality of the spaeings between adjaeent normal¬ 
ized eigenvalues resisted proof until this eentury (see, among others, [|ERSY[|ESY[|TVlllTV2ll f. 

Instead of ehoosing the matrix elements independently and having to ehoose a p, we ean eonsider 
matrix groups where the Haar measure gives us a eanonieal ehoiee for randomly ehoosing a matrix 
elementQ On an n-dimensional Lie group G there exists a unique, non-trivial eountably additive 
measure p whieh is left translation invariant (so p{gE) = p{E) for all g E G and E a Borel set); 
/i is ealled the Haar measure. If our spaee is eompaet we may normalize p so that it assigns a 
measure of 1 to G and thus may be interpreted as a probability. See HHRII for more details on the 
Haar measures and Lie groups. 

We are espeeially interested in the ease where G C GL(I/) is a eonneeted linear Lie group; 
we take pij to be the projeetion of G onto the f, j-th eoordinate and study the distribution of the 
leading digits. Lor many G the resulting behavior is easily determined, and follows immediately 
from the observation that a system whose density is on [1, B) follows Benford’s law (see 

definition ll.il) and Theorem 1 1.41) . After introdueing some terminology, we state five eases whieh 
are immediately analyzed from the Haar density; Theorem 11.71 plays a key role in our later work 
(Theorem 11.81) . These theorems are interpretations of Haar measure deeompositions of elassieal 
noneompaet G (see HHRII f. Care must be taken to separate the notion of digit law for the eompaet 
and noneompaet eases sinee many noneompaet G do not posses a G invariant probability measure. 
So we have two definitions of leading digit law: for noneompaet G, we average the measure of 
signifieands over a neighborhood of a speeifie one-parameter subgroup (see Definition 11.21 for a 
preeise statement). If G is eompaet, the Haar measure affords a global average over all matrix 
elements. So one may think of the noneompaet digit law as a loeal, and the eompaet digit law as 
global (see Definition 1 1.1 01) . 


'These are the ensembles that turn out to be most useful in number theory, not the ones arising from a fixed 
distribution. 
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Definition 1.1. Given a base B E N, a digit law is a probability density function -0 : [1, B) -E 
[0,1]. A digit law satisfies a {B, k) power law (for k > 0) if 


= fkix) 


B^-^ - 1 1 

(k — \)B^~^ ’ 


( 1 . 1 ) 


and is B-Benford if 


'f(x) = fi(x) 


1 1 
logi? X 


( 1 . 2 ) 


Notice that limfc_^i fk^x) = f!i(x). 


Definition 1.2. Given a connected, noncompact, locally compact Lie group G with Lie algebra 
L(G), a unit direction X G L(G) which generates a one parameter subgroup x = x(t) = exp(tX) 
of G, a base B > t), a positive measure p on G, and probability function -0 : [1,5) -E [0,1] w 
say that (G, dp, x) satisfies the digit law rjj if the following holds: If we let 

UfiX) = [Y + X E L{G) \ Y LX, \Y\< e} (1.3) 


be the disk of radius e containing X that is orthogonal to X in L(G), we have 


Prob(S'B(x) < s) 


EtoV(exp(5,([log5^1og5^g)X))) 
k^ooe^O p(exp(17e([0, log5*^)X))) 


(where k is a positive integer). 


f(t) dt 


(1.4) 


Remark 1.3. The above definition, though somewhat involved, captures the essence of leading 
digit law by averaging p in the direction of X according to the significands base B. Since we are, 
in many cases, averaging the Haar measure in a specific direction, we find digit laws in components 
of matrix groups which are not amenable (SL 2 (M), e.g.) 


By the Baker-Campbell-Hausdorff formula, the averaging eondition (11.41) is equivalent to 
Prob(S«W<s) = E£„V(exp(logB'X)exp(C/.(|0.1og.)X))) 

k^ooe->-0 


p(exp(l7,([0,klogB)X))) ' 

We typically take p to be the left or right invariant Haar measure on G. If p is left or bi invariant, 
(11.51) beeomes 

kp{exp{Ufi[Q,\ogs)X))) 


Prob(S'B(a;) < s) = lim lim 


k^oo e ^-0 /i(exp([/e([0, fclog5)X))) 


( 1 . 6 ) 


Theorem 1.4. (M+, dx/x) is B-Benford. 


Proof. As the Lie algebra L(W') = M of W' is one dimensional, the perpendieular subspaee to M 
is {0}, Thus for any s E [1, B), one has Ue{[0, logs)X) = [0, logs)X, whenee (11.61) becomes 


Prob(S'B(2f) < s) 


lim 

k^oo 


k dx/x 
If' dx/x 


k log s 
/dog 5 


logs 5. 


(1.7) 


□ 


In the spirit of Theorem 1 1.41 when the Haar density deeomposes as a produet of densities on the 
matrix components, as it does in the next three theorems, the digit laws are easily determined from 
formulation (11.61) . 
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Theorem 1.5. Let G = P be the group of real-valued upper triangular matrices: 



f 

ail 

ai2 . 

Clin 

1 

p- 


0 

^22 • 

Cl2n 

, an G R/{0} / 



. 0 

0 . 


J 


The leading digit law of An for the left invariant Haar density dgi is 

• B -Benford when i = j = 1, 

• a {B, k) power law when i = j = k, 2<k<n, 

• uniform for 1 < i < j < n. 

The leading digit law of an for the right invariant Haar density dgR is 

• B -Benford when i = j = n, 

• a {B, n — k) power law when i = j = k, 2<k<n, 

• uniform for 1 < i < j < n. 

Proof The left invariant Haar measure on P has density 


dgi 


1 

®11®22 ■ ■ ■ ®nn 


i<j 


and the right invariant Haar measure on P has density 


dgR 


® 11®22 ■ ■ ■ ^nn 


dttij , 


i<j 


where doij is the Lebesgue density on M in both oases. All leading digit laws follow. 


( 1 . 8 ) 


(1.9) 


( 1 . 10 ) 

□ 


Theorem 1.6. Let D be the group of real-valued diagonal matrices: 

'ail ■■■ 0 


D = 


0 


, an G M/{0} 


( 1 . 11 ) 


For each i between 1 and n, the leading digit law of an with respect to the bi-invariant Haar density 
dg is B-Benford. 


Proof The bi-invariant Haar measure on D is 

dg = - daiida22 ■ ■ ■ dann, 

ail022 • • • a-nn 

where dan is the Lebesgue measure on R. The digit laws follow. 

Theorem 1.7. Let Di be the group of real-valued, determinant 1 diagonal matrices: 

ail ■ ■ ■ 0 


Di = 


0 


) ^ I ’ 


( 1 . 12 ) 

□ 

(1.13) 


For each i between 1 and n, the leading digit law of an with respect to the bi-invariant Haar density 
dg is B-Benford. 
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Proof. Di'is diffeomorphic to the graph of 


1 


(Oll, . . . , 


I —y - 

CtllCi22 ■ ■ * 0.n-l,n-l 


(1.14) 


and hence is diffeomorphic to an open sub-manifold of R” ^ The bi-invariant Haar measure on 
Di is thus 

dg = - d(X\\dQj22 ' ' ' dOnni ( 1 - 15 ) 

011^22 • ■ ■ 0.n-l,n-l 


where dan is the Lebesgue measure on R. The digit laws follow. 


□ 


1.3. Main Results. Our first result concerns the distribution of entries from SL„(R). Denote by 
L, U, Di C G the subgroups of unipotent lower triangular, unipotent upper triangular, and diagonal 
subgroup of SL„(R). Then g G G can be uniquely expressed as g = lud, I E L,u E U,d e Di. 
Note that each of L, U, Di is topologically closed in SL„(R), and hence each is a Lie subgroup of 
G. If I, u, hi be the Lie algebras of L, U, D respectively then [, u, hi have the vector space basis 
(which we review in Appendix |A|): 

( ^ 1) (1-16) 
where Eij is the n x n matrix with 1 in the {i,j) position and zeroes elsewhere. 

Theorem 1.8. Let dg be the normalized Haar measure on SL„(R), f E Gc{G). Then 




0(exp(X) exp(F)a) dadXdY, 


1 Ju Jd 


where dX, dY are the Lebesgue measures on (, u and 


n—1 

da = 

i=l 


diCL'ij^ 


(1.17) 


(1.18) 


is the Haar measure on Di. Consequently, the joint distribution of diagonal components is a 
product of B-Benford measures. 


The next corollary follows immediately from the invariance of dg on S'L„(R): 

Corollary 1.9. Let P,Qe SL„(R) be even order permutation matrices. For A E SL„(R), the 
joint distribution of the diagonal components of PAQ are a product of B-Benford measures. 


In other words, the joint distribution of n components is a product of 5-benford measures if there 
is an even permutation of the rows and columns which sends the n components to the diagonal 
components. As an immediate consequence of the above, we obtain results on the behavior of 
determinants of matrices from GL„(R)+ (Theorem IB.31) . For other results related to Benford’s law 
and matrices, see iiBqi, who prove that as the size of matrices with entries i.i.d.r.v. from a nice 
fixed distribution tends to infinity, the leading digits of the n! terms in the determinant expansion 
converges to Benford’s law. Also see [IBH3II for results arising from powers of fixed matrices. 

When G is compact, the Haar measure may be normalized to be an invariant probability measure 
on G, affording a global definition of digit law, stated next. 
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Definition 1.10. Fix a base 5 > 0. Let Gbea compact connected Lie group, p a positive countably 
additive probability measure on G, f ■. G ^ M. measurable. We saw that (G, p, /) satisfies the 
digit law if 

FToh{SB{f{g)) < s) = dx. (1-19) 

We shall see that when G = 0{n) or U (n), / is a projeetion of G onto the {i, j)-th eomponent 
and p is Haar, the digit laws eome as a eonsequenee of digit laws from a point drawn at random 
from a unit sphere (see Corollary II .141) . So our next result yields digit laws for eomponents of a 
point drawn at random on an n-dimensional sphere of radius r: 

S'^{r) := {x e : |x| = r}. (1.20) 

We adopt the notational convention for the unit sphere: S'” := S'"'(l). 


Theorem 1.11. Let xi be the first component of an x G S'” chosen uniformly at random. We have 
fori < a < b < B that 


Prob(a < S'b(xi) < b) 


2 r(n/2 + l/2) 
ySr r(n/2) 



(1 - 


dxi. 


( 1 . 21 ) 


As n —)■ oo, Stirling’s formula implies the above eonverges to integrating a Gaussian density, 
where erf is the standard error funetion: 


erf(a;) 



( 1 . 22 ) 


Lemma 1.12. Fix a base B > 1 and 1 < a < b < B. Let xi and x be as in Theorem \1.11\ As 
n ^ oo, Prob(a < Sb{xi) < b) is well-approximated by 


in the sense that 


2 


/TT 



dx = 


2=1 




(1.23) 


lim 

n^oo 


OO 

Prob(a < S's(a;i) <b) — 



dx 


0 . 


(1.24) 


i=i V V 2 ■gr 

Remark 1.13. Lemma 17.721 has an interesting consequence. First, consider the sequence of 
spheres S'”^ , £ G N. For n sufficiently large, with > 4, Then 


Prob(a < ^b(xi) < 6, X G ^ 


^erf 

2=1 


( InB^^ b 
2 & 


— erf 


[ IriB^^ a 
2 & 



(1.25) 


By choice of n, the additional terms from extending the sums to all i are more than 4 standard 
deviations from the mean, and contribute negligibly to the sum in the limit. Hence for n sufficiently 
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Figure 1. The distribution of the first digits base B = 10 of the first eom- 
ponent of points uniformly chosen on a sphere with n components. Top row: 
n e {100,200,500}. Bottom row: n G (10000,20000,50000). Notice the pe¬ 
riodicity when n increases by a factor of = 100. 


large, 


\im Prob(a < Sb{xi) < 5, x G erf | y 


n b 


2 5 * 




(1.26) 


JD2i 

For fixed n G N, it follows that the leading digit law of Xi in S , as i ^ oo, tends to the digit 
law Fn : [1, B) —)■ [0,1) whose cumulative distribution function is given by 




n 1 


2 B^ 


(1.27) 


As Fn{x) = FnB^ far any n E N, it follows that leading digit law of Xi in k ^ oo, falls into 
the periodic cycle of B"^ — 1 limiting digit laws Fn, 1 < n < B"^ as defined in (11.271) . We plot a 
representative set ofn in Figure\T\ 


Lemma [1. 121 and its consequences can be generalized to a fixed number of components; we do 
this in Lemma [TTI 

The spherical digit law in one component (Lemma [1.121) yield digit laws for the compact matrix 
group (9„(R), stated next. 


Corollary 1.14. The leading digit law in the {i, j) component o/0„(M) with respect to Haar is the 
leading digit law ofxi in with respect to the uniform measure. 


Proof. As On(R) contains every permutation matrix P G GL„(M), there exist permutation ma¬ 
trices P,Q E GLn(M) such that PAQ E (9„(M) sends the {i,j) entry to the (1,1) entry. By 
invaraiance of dg, it suffices to prove the Corollary for the (1,1) component of (9n(M). Recall that 
any matrix A E 0„(M) satisfies A'^A = I, so the columns of 0„(M) form an orthonormal basis 
of We may therefore embed 0„(M) in the product of n spheres x • ■ ■ x Consider 
the construction of a matrix in 0„(M) one column at a time from left to right. The first column ci 
can be selected arbitrarily from 5'"“^. The second column C 2 is a vector selected in the orthogonal 
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plane to ci in a set whieh is isometrie to In general the ith eolumn is seleeted in the 
orthogonal hyperplane to ci,..., Cj_i in whieh is a set isometrie to S'””* Sinee the 0„(M) 
aetion on a subset A c On(M) preserves the Haar measure of A, there is a measure preserving 
transformation between a basis for the Haar measurable sets of 0„(M) and measurable subsets 
Ai X A 2 ,An C S*””^ X S*””^ X S'® equipped with the uniform measure on S'*. Therefore, 
the digit law of the (1,1) eomponent of 0„(M) is equal to the digit law of S'””^ with the uniform 
measure. The leading digit law follows. □ 

Thus one sees the same asymptotie periodieity in the leading digit laws in the {i,j) eomponent 
of 0„(M) with period — 1 in n. By invarianee of dg, it follows that Lemma 11.14113.11 and 
formulas therein yield leading digit laws for a fixed number of eomponents of 0„(M), so long 
as all eomponents lie in the same row or eolumn. Lastly, analogous digit laws for the real an 
imaginary parts in a fixed number of eomponents U„(C) are immediate, sinee U„(C) eontains 
every permutation matrix and the first eolumn of Un(C) is a point on 

Remark 1.15. We leave the leading digit laws of a hyperbola as future research. 


We prove Theorem 1 1.8 1 on eomponents of SL„(M) in §l2](see also Appendix |B] for a more geo- 
metrie proof in two dimensions), and then Theorem 1 1.1 ll in ^ diseussing some additional eonse- 
quenees (we have already shown above how it yields digit laws for the elassieal eompaet groups). 
We then finish with some eoneluding remarks and thoughts on future researeh. 


2. Proof of Theorem 1 1.8 1 


Let L, (/, Di be lower, upper, and diagonal matriees determinant 1 matriees, I, u, hi be as before; 
we ean ealeulate the density of dg with respeet to the deeomposition G = LUDi. Piek any g^ in 
G, and parametrize in a neighborhood of g^ using exponential eoordinates 


g = g{X, y, Z) = go exp X exp Y exp Z. 


( 2 . 1 ) 


If we let 

g{t) = g{tX,Y,Z) = goexptX expY exp Z (2.2) 

where XgI, Lgu, Zehi, then l + u + hi = q. It follows that the derivative at go in the direetion 
of X is 


g'{f) = go{exptX)X expY exp Z, (2.3) 

so that 


g{t) ^g'{t) = {goexptX expY exp Z) ^go{exptX)X expY exp Z 
= Ad(expZexpy)-i(A:) = 


(2.4) 


By a ehange of variables and left invarianee, the differential with respeet to eoordinate bases of 
[, u, h is given by the bloek matrix 


■ [Ad(expZexpy)-i(X)]( 

0 

0 

* 

[Ad(expZ)-^(y)]u 

0 

* 

* 

■^01 
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where [Ad(exp Z exp Y) ^(X)][ is the part of Ad(exp Z exp Y) ^(X) that lies in the subspace 1. 
Thus, the volume element on G = UJDi in a neighborhood of go is given by 

I det Ad((Md)“^)[|| det Ad((d)“^)u| (2.6) 

and is independent of go- By Fubini’s theorem, <P{g)dg is 



(/)(exp(X) exp(F) exp(Z))| det Ad((M(i) )[| | det Ad(((i) )u| dZdXdY. (2.7) 


'[ Ju ^01 

Using the ordinary basis {Eij}i^j of u, the adjoint action of the diagonal subgroup 77 on u is 

dj 


Ad(d = d ^Eijd = -^Eij. 


It follows that (with respect to exponential coordinates of the first kind) 

detAd(d“^)« = 




l<i<j<n ** 

With respect to the basis {Eijjiyj of I, one can see that 

Ad{{ud)~^)Eij = {ud)~^Eijud 

takes the block form 


0i,n-j 

* 

On—2,n—j 

On— 


Ordering the basis of I along sub-diagonals, Ad((M(i) ^)[ becomes upper triangular, with 

Ad(M“^)( = id( 


( 2 . 8 ) 

(2.9) 

( 2 . 10 ) 

( 2 . 11 ) 

( 2 . 12 ) 


for all u e R. Therefore, | det Ad((Md) ^)[| = | det Ad((i ^)(|, and the adjoint action of 77 on (is 
simply 


di’ 


Ad{d-^),E,, = d-^E,,d = {d^EY{d-^yy = iAd{d)E,,y = ^E,,. 


(2.13) 


Therefore 

I det Ad((M(7)“^)(| I det Ad(((7)“^)u| = 1 (2.14) 

and Theorem [T77] completes the proof. □ 

We provide another proof of Theorem 11.81 through a geometric approach, based on the area of 
the hyperbolic sector, in Appendix |Bj 


3. Proof and Consequences of Theorem II.1 II 

For r > 0, let 

5”(r) = {x e kl = ^} (3-1) 

be the sphere of radius r in Denote by 14 (r) and 5'„(r) the volume and surface area of 

S^{r) (recall we write S'" for the unit sphere). Fix a base 77 > 1 and let Ssix) be the significand 
function, i.e., S'^dyl) G [1, 77) is the unique number satisfying 

IbI = Ss(|!/|)B‘ (3.2) 


for some /c G Z. 
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Proof of Theorem ll.l 1\ Pick a point x E S'^ uniformly at random, and let xi be the first component 
of X. We are interested in the leading digit distribution of xi. By symmetry, the distribution for 
other eomponents will be similar. Notice in that for 0 < a < 1 


{a:i = a}n5” = 


(3.3) 


Approximating the surface area in the strip {a < xi < b,x E S'”} by a frustum, it follows for 
n > 0 that 

la - xl)dxi 

Prob(a < Xi < b, X E S'”) = - ^ 


^n(l) 

By the familiar relationships S'„(r) = V^{r) = ^^Vn{r), and the closed form solution 

^{n+l)/2^n+l 


^n{r) ^ ^ , 


(3.4) 


(3.5) 


we find 


Prob(a < Xi < b,x E S'”) = 


la ^/^^n-l(Vl-x2) dxi 

(n + l)K(l) 

1 nP(n/2 + 3/2) f^ _ 
SF(n + l)P(n/2 + l)y/^ 


1 P(n/2 + l/2) 
SF P(n/2) 


(3.6) 

Now, fix a, b, where 1 < a < 6 < 5. By symmetry, we may double the digit distribution in the 
positive half-space xi > 0. Thus 


Prob(a < S'b(xi) <b,xE S'”) = 


For example, when n = 1 we have 


2 P(n/2 + l/2) 
TT T{n/2) 


°o pb*B-' 
a*B~^ 


E 

i=l 


(1 — xlY^"^ ^ dxi- (3.7) 

□ 


Sl^J=^^So{^/l-xl)dxi 


Prob(a < Xi < b,x E S'^) = 


^i(l) 


— / — , dxi 

TT Ja \/l — xl 

arcsin(6) — arcsin(a) 


TT 


and thus 


2 / 

Prob(a < Sb{xi) < b,x E S^) = — ( 


arcsm 


71 ^^ \ 
2=1 ^ 


b 


arcsm 


5 ’ 


(3.8) 


(3.9) 
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while for n = 2 


Prob(a < xi < b,x e S‘^) = 




^2(1) 


la ‘^^dXi 

Att 
b — a 


(3.10) 


which implies 


Prob(a < Sb{xi) < 6, x G S'^) = 


B-1 


(3.11) 


The leading digit distribution on S'^ is uniform, with respect to any base, which is akin to the 
fact that equal width slices of a spherical loaf contain the same amount of crust. Our main theorem 
is an asymptotic result. In a sense, the digit distribution is found by applying Stirling’s formula 
and integrating the standard Gaussian. 

Proof of Lemma [77/2] Let a, 6 G M satisfy 1 < a <b < B. Recall, from the above derivation (see 
(13.71) ) that Prob(a < S'b(xi) < 6, x G S'"') equals 


2 P(n/2 + l/2) 
TT P(n/2) 




E 

i=l ■ 


(1 - dx. 


(3.12) 


By Stirling’s approximation 


P(n/2 + 1/2) 
P(n/2) 


0 ( 1 ). 


(3.13) 


Lfsing this with the substitution x = y^^/2/n, dx = dy^j2/n in the integrand yields, for n suffi¬ 
ciently large and x G S'", that 


Prob(a < S'b(xi) <b) = 


+ 0 ( 1 ) 


l + 0( — 
n 


l + 0( — 
n 


E 

i=l ■ 

2 


~ b 

2 fit 


n a 
2 


y 


n/2-l 


n b 
2 


n/2 y M n 

.2 \ 02-1 


dy 


n a 

1 B* 

b 

2 fl* 2 

e-y 

n a 

2 B* 


y 

’ n/2 
dy. 


dy 


(3.14) 


□ 


Lemma [T. 121 can be generalized to many components. Pick a point at random on the unit sphere 
S'" C and consider the first k components xi,... ,Xk {k < n + 1). We are interested 

in the joint distribution of leading digits that appear in the first k components. Similar to the 
analysis above, for a point (oi, 02 ,..., a^) in the open unit disk D^, notice that the other n — k + l 
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components lie in a n—/c sphere of radius — af — • • • — a^. Exploiting the rotational symmetry 

in the last n — k + \ eomponents, we may parametrize the surfaee element dSn of by as 

dSn{xi,...,Xk) = Sn-k{\l^ -xl - xl) dS'fc(Xi, ..., Xfc) 

dSn{xi,...,Xk) = Sn-k{\ll-x\ - xl) dxi ■■■ dxk-(3.15) 

^ V1 - - xi 

Lemma 3.1. Fix an integer k > and let ai^bi,... ,ak,hk G M satisfy 0 < |aj| < \hi\ < 1, 
1 <i < k. For n > k sufficiently large, 

Prob(|ai| < |xi| < \bi\,..., \ak\ < \xk\ < \bk\,x e S'") (3.16) 


is well-approximated by 



(3.17) 


in the sense that the difference between (13.161) and (13.171) tends to zero as n ^ oo. 

Proof. Similar to Lemma [1.12[ we need only worry about when ai,bi > 0. By symmetry and 
substitution 


Prob(ai < Xl < bi,... ,ak < Xk < bk,x ^ S'") 


= 2^ 


dS{xi, ...,Xk) 


' ai<xi<bi,...,ai,<Xk<bk 


Snil) 


fbl pbk j - 

/ •••/ Sn-k{^l-xl - xl) 

J (1\ J CLg. 


2'^{n-k + l) 

(n + l)K(l) 




f ai 








Vn-k{\Jl-xl - xl) 


dxi■■ ■ dxk 


Xl 


V^-xl - 


dxi■■ ■ dxk 


xl 


/ 2 3 

r(n /2 + l/ 2 ) 

■ f 

VViy 

1 r(n/2 - i/2 + 1/2) X, 

J ak 


{1 — xl — ■ ■ ■ — xl)^^ ^ dxi • • ■ dxfc. (3.18) 


Stirling’s approximation 


r(n/2 + 1/2) 
r(n/2 - kl2 + 1/2) 




and the substitutions Xi = Pi/^/n/2 complete the proof. 


(3.19) 

□ 


Corollary 3.2. Fix an integer fc > 0. For any base B > 1, and ai,bi,... ,ak,bk G M satisfying 
1 < Qi < bi < B, 1 < i < k, we have for n > k sufficiently large 


Prob(ai < S's(a;i) < 6i, • • •, < Ss^Xk) <bk,x e S'") 

is well-approximated by 


k oo 




j=i i=i 


TT 


2 S' 2 

e-^ dx 

n “J 
2 SI 


(3.20) 


(3.21) 


12 

























in the sense that the difference between (13.201) and (13.211) tends to zero as n ^ oo. In particular, 
the joint leading digit distribution of the first k components is asymptotically periodic in n, with 
period B^, tending to one of the — 1 limiting distributions 


K K LXJ y 


j=i 


j=l i=—oo 


-erf 
2B 


2 5 * 


i < n < 5 


(3.22) 


4. Conclusions and Future Work 

Our results above ean serve as a means for deteeting underlying symmetries of a physical system. 
For example, imagine we are trying to construct matrices from one of the classical compact groups 
according to Haar measure (see HMezll for a description of how to do this). We can use our digit 
laws as a test of whether or not we are simulating the matrices correctly. It would be interesting to 
generalize the arguments above to other groups of matrices, including those over fields other than 
the reals. 


Appendix A. Linear Lie groups 

A Lie group G C GL(1/) is a group equipped with a differentiable structure such that the 
binary operation G x G ^ G is differentiable. The Lie algebra L{G) may be naturally identified 
with the tangent space Te{G) to the identity. For a direction X G L{G) there is a unique one 
parameter subgroup exp(fX), f G M, in the direction of X and the map exp : L{G) G is a local 
diffeomorphism. Let Eij be the n x n matrix with 1 in the {i,j) entry and zeroes elsewhere. 

The groups in this paper are the following. 

• The general linear group GLji(M) of matrices of nonzero determinant and its Lie algebra 
0 [„(M) of all n X n matrices. 

• The special linear group: SL„(M) = {A G GLffV) \ det A = 1} and its Lie algebra 
sG(M) = {X G gln{i^) I = 0} of traceless matrices. 

• The space of diagonal matrices D C (jL„(M) with nonzero diagonal entries and its Lie 
algebra h of diagonal matrices with entries in R. 

• The space of diagonal matrices determinant Di(R) C (jL„(R) with nonzero diagonal 
entries and its Lie algebra hi of traceless matrices with entries in R. 

• The space of upper triangular matrices U (R) C GEffM) with nonzero diagonal entries and 
its Lie algebra u of upper triangular matrices with entries in R. 

• The space of lower triangular matrices L(R) with nonzero entries and its Lie algebra [ of 
lower triangular matrices with entries in R. 

• Note that 

f ^ J }*<j)) 1- (-^-l) 

where Etj is the n x n matrix with 1 in the (f, j) position and zeroes elsewhere. 

• The orthogonal group: 0(n)(R) = {A G (jL„(R) | A^A = /} and its lie algebra 0n(R) = 
{A G M„(R) I E'^ + F = 0} of skew symmetric matrices 

• The unitary group Un{C) = {U E GL„(C) \ U*U = 1} and its lie algebra u„ = {PF G 
Mn{C) \W + W^ = 0}. 

The complex lie groups GL„(C), 0„(C), U(C), L(C), D(C), Di(C) are defined analogously. 
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Appendix B. Haar measure on SL2(M) is Beneord in each component 


The goal of this section is to provide a geometric proof of Theorem II .Sl in two dimensions. We 
start with a useful, classical result. 

Lemma B.l. The area of the hyperbolic cone 

{{t^t/x) :0<f<l, 0<a<a:<&} (B.l) 


is equal to \og{h/a). 

Proof. The region under the curve 1/x has area log(fe) — log(a) = log(6/a), and one can form the 
sector from this region by first attaching the triangle with corners (0, 0), (a, 0), (a, 1/a) and then 
removing the triangle with comers (0, 0), (6, 0), (6,1/&). Both triangles have area 1/2. □ 

Treating SL 2 (M) as the graph of d = (1 — ac)/h, construct from A C SL„(M) the cone on A to 
the origin. Since the SL 2 (M) action preserves volume, the Haar measure on SL„(M) equals (up to 
a scalar) the volume of the cone on A C SL„(M). This observation forms the basis of the proof. 

Theorem B.2. The (1,1) component o/SL 2 (M) with Haar measure is B-Benford. 

Proof Write SL 2 (M) as 

; ad — 6c = l| . (B.2) 

We give a series of statements that simplify the argument but create no loss of generality. Clearly dg 
is i?-Benford in the (1,1) component if and only if c dg is is i?-Benford in the (1,1) component, 
so we take the Haar measure on SL 2 (M) that was constructed earlier. Let an = a; notice that 
a = 0 is a zero measure subset of (SL 2 (M),/i), so we treat SL 2 (M) as the graph of the function 
d = (6c — l)/a. By symmetry it suffices to prove the theorem when our sequence of compact sets 
Ki lie in SL 2 (M)+ when K = graph(d) (with d = (1 — be) /a), defined over a rectangular domain 
D = [l,x) X [—e, e] x [—e,e]. 

Recall that p{K) = /i(graph(d)) = A(C(graph(d))) is the volume of the cone consisting of all 
line segments from O to the graph of d. Consider the solid S := 5'(graph(d)) bounded below the 
graph of d whose volume is 

A(S') = JJJ ^^ da db dc. (B.3) 

We wish to relate A(C'(graph(d))) to A(S'(graph(d))). By our restriction to positive coordinates, 
we see that d is decreasing along each ray emanating from the origin in a direction of D. As we are 
assuming graph(d) > 0 on Zl, A(C(graph(d)) can be found by appending to S the three pyramidal 
regions whose bases are the (3-dimensional) faces of S, given by 

S' n {a = 1}, S' n {6 = —e}, S' n {c = —e}, (B.4) 

then removing the pyramids whose bases are the faces 

S' n {a = x}, S' n {6 = — e}, S' n {c = e}. 
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jab 
c d 


(B.5) 







The apex for all 6 pyramids is the origin. Thus 

A(C(graph(d))) = A(^) + A(C'(^n{a = 1})) -A(C(^n{a = a:})) 

+ X{C{S n{b = -e})) - X{C{S n {6 = e})) 

+ X{C{S n {c = -e})) - X{C{S n {c = e})). (B.6) 

Reeall that the 4-dimensional volume of a pyramid is 1/4 the volume of the base time the height, 
and the volume of the base of eaeh pyramid is simply the double integral over the appropriate sliee. 
Thus 

f f 1 — 6c 


A(C(graph(d))) = 


ID 



' —eJ —t 

fX /•£ 



'1 J-e 



1 J-e 


da db dc 


l^dcdb-^ 

1 4 

1 + ec , , e 

- dcda —7 

a 4 

db da 

a 4 



' —e 
rx ne 



1 — be 

X 

1 — ec 


'1 J-e 

f*X /•£ 


a 



be 


dc db 
dc da 
db da. 


(B.7) 


1 J-e 


Notice that the second and third terms cancel, and each integral that remains is separable, with 
the same limits of integration on a. If we let F{e) be the quantity 


F(e) = 



' —e J —e 
1 
4 


(1 — be) db dc 

f (—e(l -|- ec) — e(l — ec)) dc 


then 

and so 


(—e(l — be) — e(l — be)) dbj , 

(B.8) 

A(G(graph(d))) = log(a:)F(e), 

(B.9) 

log(x)F(e) 

= log(B)F(.) = 

(B.IO) 

□ 


Theorem B.3. The leading digit law on the determinants o/GLji(M) is B-Benford. 

Proof. Let GL„(R)''' be the group of all invertible n x n matrices with positive determinant. The 
map 

/: GL,,(M)+ ^ M+ X SL„(M) (B.ll) 

given by f{g) = {det{g), {det{g))~^g) is a Lie isomorphism, allowing for a decomposition of the 
Haar measure on GLji(M)’'' as follows: there exists a constant c > 0 such that for any compactly 
supported function f e Gc(GL„(M)+) 

[ fig) detig)-"^ dg = c [ — [ (j){ry)dij'{y). (B.12) 

7GLn(R)+ JM.+ ^ 7sl„(K) 
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As any compact set in GL„(M)''" can be well approximated by eubes of the form [—e,e]K', 
K' G SL„(M) eompaet, the result follows. 

□ 
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